Record linkage software in the public domain: a comparison of Link Plus, The Link King, and a 'basic' deterministic algorithm

نویسندگان

  • Kevin M. Campbell
  • Dennis Deck
  • Antoinette Krupski
چکیده

The study objective was to compare the accuracy of a deterministic record linkage algorithm and two public domain software applications for record linkage (The Link King and Link Plus). The three algorithms were used to unduplicate an administrative database containing personal identifiers for over 500,000 clients. Subsequently, a random sample of linked records was submitted to four research staff for blinded clerical review. Using reviewers' decisions as the 'gold standard', sensitivity and positive predictive values (PPVs) were estimated. Optimally, sensitivity and PPVs in the mid 90s could be obtained from both The Link King and Link Plus. Sensitivity and PPVs using a basic deterministic algorithm were 79 and 98 per cent respectively. Thus the full feature set of The Link King makes it an attractive option for SAS users. Link Plus is a good choice for non-SAS users as long as necessary programming resources are available for processing record pairs identified by Link Plus.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rule Your Data with The Link King© (a SAS/AF® application for record linkage and unduplication)

Administrative datasets containing client identifying information (names, birthdates, SSNs) are often used for a variety of research and evaluation projects. The projects often require the linking of two or more independently maintained client rosters in order to track service utilization across different systems. Unfortunately, a given client may be represented with slightly different identify...

متن کامل

Probabilistic Linkage of Persian Record with Missing Data

Extended Abstract. When the comprehensive information about a topic is scattered among two or more data sets, using only one of those data sets would lead to information loss available in other data sets. Hence, it is necessary to integrate scattered information to a comprehensive unique data set. On the other hand, sometimes we are interested in recognition of duplications in a data set. The i...

متن کامل

بررسی وب‌گاه‌های ادارات کل کتابخانه‌های عمومی ایران: مطالعه وب‌سنجی

Purpose: Through analysis of different types of web links, it is aimed in this study to evaluate the status of links in provincial websites of Iran Public Libraries Foundation. Methodology: Link analysis as a webometric method was used in the present research. Data collection was accomplished by LexiURL software and Yahoo search engine. The population under study included the Provincial websit...

متن کامل

The Need for a Strong Public-Private Linkage in Agricultural Extension System (Case Study: Sari Township, Iran)

Relationship between public and private sector is becoming an increasingly important issue in management of agricultural extension services. The need for a strong linkage could be identified as the gap between desirable and current situation. In this research, the differences among current and desirable situation in six diverse dimensions was calculated. The current and desirable situation was ...

متن کامل

Probit-Based Traffic Assignment: A Comparative Study between Link-Based Simulation Algorithm and Path-Based Assignment and Generalization to Random-Coefficient Approach

Probabilistic approach of traffic assignment has been primarily developed to provide a more realistic and flexible theoretical framework to represent traveler’s route choice behavior in a transportation network. The problem of path overlapping in network modelling has been one of the main issues to be tackled. Due to its flexible covariance structure, probit model can adequately address the pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Health informatics journal

دوره 14 1  شماره 

صفحات  -

تاریخ انتشار 2008